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Abstract: The recently developed theory of higher-rank numerical ranges 
originated in problems of error correction in quantum information theory but 
its mathematical implications now include a quite satisfactory understand- 
ing of scalar compressions of complex matrices. Here our aim is to make 

CSj ' some first steps in the more general program of understanding normal com- 

pressions. We establish some general principles for the program and make a 

<£ detailed study of rank-two normal compressions. 
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1: Introduction 

00 

o. 

Given a linear operator T on a complex Hilbert space EL and any orthogonal 
projection P, we say that PT\p^ is a compression of T. If EI = C and T 
is represented by a matrix M G M^r (the N x N complex matrices) , a second 
matrix C represents a compression of T (or a compression of M) iff there is a 
unitary matrix U such that C is a NW corner of UMU*. If C is k x k we say 
it is a rank-fc compression of M. There is a rich history of results that allow 
^ us to identify compressions by means of intrinsic criteria. A classic example 

is the Cauchy interlacing theorem [Cau] , along with its converse [FP] , which 
may be expressed as follows. 

Theorem 1: If M G M^r is Hermitian, with eigenvalues 

«1 < «2 < • • • < Ojv, 

then C is a rank-/c compression of M iff C is Hermitian with eigenvalues bj 
satisfying 

o,\ < bi < ajv-fe+i, a 2 < b 2 < a^-k+21 ■ ■ ■ , a k < &fc < &n- 



In particular, C is a rank N — 1 compression iff 

ai <bi < a 2 <b 2 < 03 < ... < Oat-i < &jv-i < a>N, 
the classic "interlacing" of eigenvalues. 

A much more recent example is provided by the theory of higher— rank 
numerical ranges. The striking development of this theory was motivated 
originally by problems in quantum information theory. Since the introduc- 
tion of this concept by Choi, Kribs, and Zyczkowski [CKZ1,CKZ2] only a few 
years ago, it has indeed been effectively applied in the area of quantum in- 
formation (see [CPMSZ,KPLRdS,LP,LPSl,MMZ], for example). It has also 
inspired a remarkable development of its purely mathematical aspects (see, 
for example, [CHKZ,CGHK,Wo,LS,LPS2,DGHPZ]). From this point of view 
the theory of the higher-rank numerical ranges may be described as a highly 
successful analysis of scalar compressions of arbitrary matrices M G Mat- 
This suggests a more general program: characterize the normal (diagonal) 
compressions of M. In what follows we begin to carry out this program, 
although at present the program in its entirety seems out-of-reach. 

The rank-/c numerical range of M, usually denoted in the literature by 
Afc(M), was defined by Choi, Kribs, and Zyczkowski as the set of those 
complex A such that for some rank-/c orthogonal projection P we have 

PMP = \P. 

In terms of compressions, we see that A G Afc(M) iff Xlk is a (matrix) com- 
pression of M. Thus the following fundamental result of Li and Sze [LS] may 
be placed in the same family as the Cauchy interlacing theorem (and, in fact, 
the interlacing theorem plays a role in the argument of Li and Sze). 

Theorem 2: Given M G M^r, let Xj(9) be an enumeration of the eigenvalues 
of the (Hermitian) 

Re(e w M) = (e w M + e~ i6 M*)/2 

such that 

\ 1 (9)<X 2 (9)<---<\ N (9). 

For each real 9, let the half-plane H(M, 9) be defined by 
H(M,9) = e w {z : Re(*) < A^_ fc+1 (-^)}. 



Then 

A fc (M) = f]{H(M, 6):6e [0, 2vr]}. (1) 



Our more general program seeks to describe all normal compressions of 
M, ie to describe those complex oi, . . . , a& such that diag(ai, . . . , a&) is a 
compression of M. Equivalently, we ask when there exist orthonormal 

u 1 ,u 2 ,...,u k 

such that (Mui,Ui) = Oj for each i and (Mui,Uj) = whenever i ^ j; in 
particular, Ai(M) is nothing but the classical numerical range 

W(M) = {(Mu,u) : ||u|| = 1} 

(hence the "higher-rank numerical range" terminology). In this work we 
usually restrict our attention to the case where M itself is also normal, al- 
though we occasionally comment on cases where either M or its compression 
may not be normal. 

Note that for normal M G Mjy(C) Theorem 2 shows that A&(M) can be 
explicitly described in terms of the eigenvalues zi, . . . , z^ of M: 

A fc (M)= f| cowrizj : j e J}. (2) 

#{J)=N-k+l 

We shall refer to this result, first proposed by Choi, Kribs, and Zyczkowski, 
as the CKZ conjecture, although it is now a theorem. The CKZ conjecture 
played an important role in the development of the theory of higher-rank 
numerical ranges. For example, while Li and Sze gave an effective description 
of Afc(M) for non-normal M (Theorem 2), their proof of the CKZ conjecture 
was a key step towards the general result. Of course, the case k = 1 of ([2]) is 
easy and well-known: for normal M, W(M) = conv{zi, . . . , z^}. 

The following observation is often useful. 

Proposition 3: For every M G Mjv, if k < N, C is a rank-fc compression of 

M, and Q is a compression of rank N — k + 1, then 

W(C)nW(Q)^Q. 



Proof: Let S and T be the subspaces corresponding to compressions C 
and Q. Since the dimensions add to more than N, S and T must intersect 
non-trivially; let u be a unit vector in S (IT. Then 

(Mu,u) = (Mu,P s u) = (P s Mu,u) = (Cu,u) G W{C), 

and similarly (Mu,u) G W(Q). QED 

Applying this observation to the normal case, we see that part of the CKZ 
conjecture is straightforward. 

Proposition 4: If M G M^r is normal with eigenvalues Z\, . . . , z^, and the 
rank-fc compression C is normal with eigenvalues C\, . . . , C&, then for every 
index set J having #(J) = N — k + 1 



convjci, . . . , Ck} fl convjzj : j G J} 7^ 



In particular, 



Afc(M) C p| couv{zj : 3 e J} 

#(J)=N-k+l 

(compare (J2J)). 

Proof: We have noted that for normal (finite-dimensional) operators the 
numerical range is just the convex hull of the eigenvalues. Thus W(C) = 
conv{ci, . . . , Ck}- On the other hand, let Q be the compression to the span 
of eigenvectors corresponding to {zj : j G J}; then Q is normal and W(Q) = 
conv{zj : j G J}. Apply Proposition 3. In particular, for points A G Afc(M) 
we may let c\ = c 2 = • • • = Ck = A. QED 

On the other hand, the fact that Afc(M) completely fills the RHS of (J2J) is 
more subtle, in general, although for certain combinations of A^ and k it is 
relatively easy to see. To illustrate this, and to introduce the preoccupations 
of the present paper, consider the case N = 5, k = 2. In Figure 1 we see the 
eigenvalues z t , . . . , z 5 of a normal (in fact, unitary) M as the outer points 
of the blue pentagram. It is easy to see that (J5J) implies that A 2 (M) is the 
inner pentagon. As far as we know, there is no simple proof that A&(M) fills 
this pentagon, but three markedly disparate arguments may be found in the 
literature: 

(1) in [CHKZ] there is an argument based in part on topological concepts 



such as simple connectivity and winding number; 

(2) as it is easy to conclude (see section 2) that the vertices of the inner 
pentagon are in Kz{M), the fact that (whether or not M is normal) A&(M) 
is convex (see [CGHK] and [Wo])) - a striking extension of the classical 
Toeplitz-Hausdorff Theorem for W(M) - may be used; 



(3) as we have noted, 
Theorem 2. 



is a direct consequence of the Li and Sze result 
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Figure 1: Choosing a (red asterisk) at random in A 2 (M) (the inner pentagon), 
we see that B(a) includes a "starfish" that covers A 2 (M) and more. 



A fourth, and quite different yet again, approach can be obtained by consider- 
ing those eigenvalue pairs a, b that can belong to rank-2 normal compressions 
of M. Given a G C we denote by B(a) the set of b that match a in this sense. 



We shall prove in section 3 that for a in the inner pentagon B(a) includes a 
"starfish" (outlined in green for the example of Figure 1) covering the (filled) 
pentagon (our conjecture, in addition, is that the starfish is precisely B(a)). 
Since a G B(a) says that a G A2(M), we conclude once again that Aa(M) 
fills the pentagon. 

Plan of the paper: section 2 has some general results, section 3 treats the 
case k = 2, section 4 examines continuity of B(-), and section 5 discusses 
non-normal compressions. 

Acknowledgements: We have enjoyed many stimulating discussions of ma- 
trix compression, particularly those with M.-D. Choi, C.-K. Li, Y.-T. Poon, 
N.-S. Sze, and J. F. Queiro. Versions of the material in this paper were 
developed in [M]. The work of Holbrook and Pereira was supported in part 
by Discovery Grants from NSERC of Canada. 

2. Some general results (arbitrary k, N) 

Note that if C is a rank-A; compression of M G M^r and C is a rank-/c' 
compression of C, then C is a rank-fc' compression of M. Thus Proposition 
3 has the following consequence. 
Proposition 5: If C is a compression of M G M^v then 

W(C) C W(M). 

Proof: Regard z G W(C) as a rank-1 compression C of C, hence of M and 
apply Proposition 3 with k — 1, C replaced by C and Q = M. QED 

Whereas Proposition 4 supplies a necessary condition on the eigenvalues 
Ci, . . . , Cfc of a normal compression C of normal M, the following proposition 
points out a sufficient condition that is sometimes useful. An interesting 
analysis of such necessary vs sufficient conditions may be found in [QD] . 
Proposition 6: If M G M^r is normal with eigenvalues zi,...,zn then 
c\ , . . . , Cfc G C are eigenvalues of a normal compression C of M provided 
that there exists a partition Ji, . . . , J& of {1,2,..., A^} such that for each 
i — 1, . . . , k 

Ci G conv{zj : j G JJ. 

Proof: For each % let q = X] 7 eJ ^i^i re P resen t Q as a convex combination. 
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Let Mi, ... , un be an orthonormal basis of eigenvectors for M, with 



For each i, let 



Muj = ZjUj. 



Wi = J2VtiJ u J- 



It is easy to check that Wi, . . . , w k are orthonormal , that (Mwi, Wj) = Ci, and 
that (Mwi, Wh) — if h ^ i. It follows that C = diag{ci, . . . , c k } represents 
the compression of M to the subspace S = span{wi, . . . , Wk}, ie 

C = P S M\ S . 

QED 

In [CKZ1] Choi, Kribs, and Zyczkowski identified explicitly the higher-rank 
numerical ranges of Hermitian matrices, and their argument may be viewed, 
along the lines of the proof of our next proposition, as an illustration of 
the combined force of the necessary condition from Proposition 4 with the 
sufficient condition from Proposition 6. Note that the result might also have 
been obtained as a special case of the Fan-Pall result, Theorem 1 (taking 
b 1 = b 2 = ■■■ = h). 
Proposition 7: If Me M^r is Hermitian with (real) eigenvalues 

then for each k < N/2 we have 

Afc(M) = [a k ,a N -k+i]- 

If a N -k+i < Ojk, then A K (M) = 0. 

Proof: If A G Afc(M) then taking c\ — ... — c& = A in Proposition 4 we see 

that 

A G conv{a fc , . . . , a N } = [a k , a N ]. 

Likewise, A G [ai,ojv-fc+i], so that A k (M) C [a,k,(iN-k+i]- 
On the other hand, considering the partition of {1, ... , iV} into 
J 1 = {l,N},J 2 = {2,N-l},...,J k = {k,N-k + l} 
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we conclude from Proposition 6 that each AG [o^, a^^k+i] is i n Afc(M). QED 

As another example of such general arguments we treat the normal compres- 
sion problem for the case k = N — 1. This result goes back to Fan-Pall [FP]; 
their proof is algebraic in character whereas ours is more geometric. We 
restrict to the case where the matrix and its compression have no common 
eigenvalues since this is where our general principles are most pertinent; Fan 
and Pall also treat the general case by means of a direct sum construction. 
Proposition 8: Let zi, . . . , zn and c±, . . . , cn-i be two collections of complex 
numbers having no elements in common. Then there is a normal M G M^v 
with eigenvalues Zj having a rank-(iV — 1) normal compression C with eigen- 
values Cj iff the Zj are collinear and alternate with the Cj (in some order) 
along the common line. 

Proof: Let us first show that if such M, C exist then the Zj must be collinear. 
Label the Zj lying on the boundary of W(M) in counterclockwise order: 
zi, . . . ,z p . If the Zj are not collinear there must be some Zk-i, Zk, Zk+i that 
are not collinear, as in Figure 2. Proposition 4 requires that [2fc_i,2fc] meets 
W(C) at some A closest to z k ; this A is extreme in W(C) and so must be an 
eigenvalue of C. Similarly we have an eigenvalue /x of C in [zk,Zk+i], as in 
Figure 2. Note that Proposition 4 also tells us that Zk cannot be a repeated 
eigenvalue of M, since it would then coincide with an eigenvalue of C. 

Let Mi, ... , Mat be an orthonormal set of eigenvectors of M, with Muj = ZjUj, 
and let orthonormal v, w be eigenvectors of C with Cv = Xv and Cw = fiw. 
Expand v, w in terms of the uf 

N N 

j=l 3=1 

then 

N 

A = (Cv,v) = (Mv,v) = ^2\aj\ 2 Zj, 

3=1 

so that Qj = unless Zj lies on the line through Zk-i,Zk- Similarly bj = 
unless Zj lies on the line through Zk, Zk+i- Since Zk is the only common point, 

= (v,w) = a k b k . 

If a*; = we have A = Zk-i, which we have ruled out, while if bk = we have 
jj, = Zk+i, also ruled out. 



_ k-1 




"k+1 



Figure 2: An example of the eigenvalue geometry ruled out in the proof of 
Proposition 8. 



Thus the eigenvalues all lie on a common line and by an affme map M — y 
aijv + fiM this common line can be R, ie we are in the Hermitian case. 
Proposition 1 then completes the argument, giving the interlacing property. 



On the other hand, if the collinearity and interlacing conditions are met, the 
same sort of affine map and Proposition 1 establish the existence of M and 
C. QED 



3: Results for k — 2 and small N 

For 2x2 normal compressions diag(a, b), we can give a more detailed account 
of the afr-geometry, leading up to an understanding of the "starfish" seen in 
Figure 1. 

Recall that, given normal M G M^r and complex a, we denote by B(a) the 
set of complex b such that diag(a, b) is a compression of M. Of course, in 
order that B(a) should be nonempty we must have 

a G conv{z l7 z 2 ,. . -,z N }, 

where the Zj are the eigenvalues of M. Note that Proposition 4 also requires 
that for b G B(a) we require that the line segment [a, b] intersect 

coiw{zj : j 7^ i} 

for each % — 1, . . . , N. 

The simplest case to consider: N = 3 and the eigenvalues of M form a 

nontrivial triangle. 

Proposition 9: Suppose that the eigenvalues Zi,Z2,z^ of normal M G M 3 

are not collinear. Then b G B(a) iff either a is one of these eigenvalues, say 

a — z\ and b G [z2, 23] (the opposite side of the triangle formed by zi,Z2,zs) 

or a is in one of the sides, say [z 2 , £3], and b — z\. 

Proof: Since [a, b] must meet each of the triangle's sides, the necessity of 

the condition is clear. On the other hand, Proposition 6 shows that these 

conditions suffice for a, b to be the eigenvalues of a normal compression. QED 

Remark: Here we have a very simple case of the result of Fan and Pall [FP] 

where they characterize in general the case k — N — 1. 

When iV = 4 we encounter more complex behaviour, such as that seen in 
Figure 3, where B(a) is a curve interior to conv{^i, z 2 , z%, Z4} (except for 
endpoints). 



10 



0.8 



0.6- 



0.4- 



0.2- 



-0.2 



-0.4- 



-0.6 



-0.8- 




-1 



-1 -0.8 -0.6 -0.4 -0.2 



0.2 0.4 0.6 0.8 



Figure 3: For a (red asterisk) strictly inside the upper quadrant (case (a)), 
we see that B(a) is a curve in the opposite quadrant. 

To analyse such behaviour, it will be convenient to assume in what follows 
that the eigenvalues of M are generic in the sense that no three are collinear. 
We may also assume that M = diag(zi, . . . , Zjv), so that the eigenvectors of 
M are the standard basis vectors Cj. 

Note that if b G B(a) we have orthonormal u, w such that 

(Mm, u) = a, (Mw, w) = b, and (Mu, w) = (Mw, u) = 0. 

Thus a — 5^i \ u j\ 2z ji & convex combination. Let Ajv denote the iV-dimensional 
simplex, ie conv{ei, . . . , e^}] then \u\ 2 (where the operations are performed 
componentwise) belongs to 

JV 



C(a) = {t G A N : a = Y^tjZj}. 
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By exchanging complex arguments between the components of u and w we 
may assume that u > 0; then the possible u lie in {y/t : t G C(a)}. The 
conditions on w G C^ are then given by 

||w|| = 1, w _L u, w _L z o u, and w -Lz o u, 

where o indicates Schur (componentwise) multiplication, so that 

ZOU = (z 1 u 1 ,...,z N u N )', 

with ' indicating transpose. 

We may thus describe B(a) as follows. 

Proposition 10: Given a G W(M)(— convj^, . . . , z N }), 

B(a)= |J B(a,t), 
tec (a) 

where 

N 

B(a,t) = {S^ \ w j\ 2z j '■ \\ w \\ — 1) w -L v^i z ° Vt,z ° Vt}- 
i 

Proof: To the discussion above we need only add the observation that 

A? 



b= (Mw,w) = Y^ 



Wj\ 2 Zj. 



QED 



Clearly C(a) is a compact convex subset of A^. It is therefore the convex 

hull of its extreme points, which are identified in the following result. 

Proposition ll:The extreme points of C(a) are those t G C(a) such that 

at most three £& > 0. 

Proof: Consider t G C(a) such that £& > for at least four values of k. We 

show that t is not extreme. For convenience assume ti,t2, £3, £4 > 0. The 

space 

X = {x G R N : a; fc = for k > 4} 
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is 4-dimensional. Hence 

4 



7 = {xGl:^x fc = 0,^ x k Re{z k ) = 0, ^ x fc Im(« fc ) = 0} ^ {0}. 



i 



Let O^i/GV. Then for sufficiently small e > we have t ± ey G A^ and 

y~fo ± ey)fc« fc = ^2 tkZk = a ' 

k k 

so that £ ± ey G C(o). Hence £ is not extreme. 

On the other hand, if at most three components, say t 1 ,t 2 ,t 3 of t G C(o) 
are positive, and t is the average of t',t" G C(o), then t' k ,t'l = for k > 3. 
Because no three Zj are collinear, 

a = t\Z\ + t 2 -2 2 + t 3 z 3 

is the unique representation of a as a convex combination of zi,z 2 ,z 3 . Hence 
i! = t" = t. QED 

For distinct indices i,j, I, let t(i,j, I) denote the element of C(a) (if it exists) 
such that tk(i,j,l) = whenever k ^ i,j,l- Note that such elements are 
uniquely determined since 

a = U(i,j,l)zi +tj(i,j,l)zj +ti(i,j,l)zi 

represents a uniquely as a point in the triangle conv{zj, Zj, zi}; here again we 
use the assumption that no three of the eigenvalues Zj are collinear. Thus 

C(a) = conv{t(i,j,l) : i,j,l are distinct and a G conv{^, Zj, zi}}. (3) 



The complexity of B(a,t) increases with the number of nonzero t k . For 
example, if only one t^ > 0, then t k — 1 and a = z k . Here the simple 
sufficient condition of Proposition 6 is also necessary: 

B(a,t) = conv{zj : j ^ k}. 
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We see this as follows. Evidently, with u = \/t = e^, u,w are orthonormal 
exactly when w = Ylj+k a j e j with Ylj^k \ a i\ 2 = lj then 

b = (Nw,w) = y. \ a j\ 2z j ^ conv{zj : j ^ k}, 

3+k 

and any b G conv{zj : j ^ k} can be obtained in this way. 

The same sort of simplification occurs if only two or three tk > 0. 
Proposition 12: (a) If t G C(a) has exactly two positive components, say 
ti, t 2 > 0, then 

B(a,t) = conv{,2j : j > 2}. 

(b) If t G C(a) has exactly three positive components, say ti,t 2 ,t 3 > 0, then 

B(a,t) = conv{zj : j > 3}. 
Proof: (a) Since a G conv{zi, Z2}, Proposition 6 tells us that 

B(a,t) D conv{zj : j > 2}. 

On the other hand, with u — y/i — {\/ti, Vh, 0, . . . )' we see that u,w are 
orthonormal iff \\w\\ = 1 and (wi,^) _L (Vti,Vh)', similarly (Mu,w) = 
only if (wi,W2) -L (y/tizi, y/t^z-i). Since Z\ 7^ z 2 , we have W\ = w 2 = so 
that 

b = (Mw,w) G convj^j : j > 2}. 

(b) Since a G conv{z 1 ,z 2 ,z 3 }, Proposition 6 tells us that 

B(a,t) D conv{zj : j > 3}. 
On the other hand, with u = \/t we have u, w orthonormal iff \\w\\ = 1 and 

(wx, w 2 , w 3 ) _L (v 7 ^, Vh, Vh) 
and (Mu,w) = (Mw,u) = only if 

(twi, u> 2 , w 3 ) _L ( v / t7Re(zi), v / t 2 "Re(z 2 ), VhRe(z 3 )), (Vt 1 lm(z 1 ) : y/hlmfa), Vt^lm(z 3 )). 
Since 21,2:2,23 are not collinear, 

(1,1,1), (Re(2i),Re(2 2 ),Re(2 3 )), (Im(2i),Im(2 2 ),Im(2 3 )) 
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are linearly independent. We must have W\ = w 2 — w 3 — so that b = 
(Mw,w) G conv{zj : j > 3}. QED 

We are now in a position to understand the features of Figure 3 and, indeed, 
to analyse all the possibilities when N — 4. We treat in detail the case where 
z±, z 2 , z 3 , z± are all extreme in conv{zi, z 2 , z 3 , z^}; the case where one of the 
eigenvalues lies in the interior of W(M) (eg -2 4 G conv{zi, z 2 , z 3 }) can be 
treated similarly. 

Proposition 13: Let N = 4 and suppose that z\, z 2 , z 3 , Z4 are all extreme 
in W(M) and are numbered in counterclockwise order. The diagonals [-21,-23] 
and [-22,-24] meet at q and divide W(M) into four quadrants. Consider a G 
W(M); the possibilities for B(a) are as follows. 

(a) See figure 3: a lies in the interior of one of the quadrants. For convenience, 
assume that a G conv{zi, z 2 ,q}; let x = £(1,2,3), y = £(1,2,4). Then B(a) 
is the curve traced out by the function b(r) defined for < r < 1 by 

Kr) = Y, {Xk ; Vkf zjy ■ {Xk ~ yk)2 . 

^{l-rjxk + ryk I ^ (1 - r)x k + ry k 
Note that rc 4 = and y 3 = so that 

limfr(r) = -2 4 , lim6(r) = z 3 , 

r^O r-tl 

and we obtain a continuous curve parametrized on [0, 1] when we interpret 
6(0) as ,24 and 6(1) as z 3 . Except for these endpoints, the curve lies in the 
interior of the opposite quadrant conv{-2 3 , -2 4 , q}. 

(b) If a lies in the interior of one of the sides of W(M) then B(a) is the 
opposite side (eg if a is inside [-21,-22] then B(a) = [-2 3 ,-2 4 ]). If a — z k then 
B(a) is the opposite triangle conv{-2j : j 7^ k}. 

(c) See Figure 4: a lies interior to the diagonals but is not q; say a is interior 
to [-2i, q]. Then B(a) is the T-shaped object [-22,-24] U [5,-23]. 

(d) If a = q then B(a) is the union of the two diagonals. 
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Figure 4: For a (red asterisk) strictly inside the segment [z\, q] (case (c)), we 
see that B(a) is the T-shaped object consisting of [z 2 , z$] U [q, z 3 ]. 



Proof: (a) Since a lies in the triangles convj^i, z 2l £3} and convjzi, z 2 , £4} 
but in no other triangle of eigenvalues, C(a) = [£(1,2, 3), £(1,2, 4)] = [x,y] 
(recall the relation (J3J)). For < r < 1 consider the £ G C(a) given by 
£ = (1 — r)x + ry. We shall see that B(a, £) consists of the single point b(r). 
We take u = \ft and note that the conditions on w are: w _L u, w ± uoRe(z), 
w _L uolm(z), and ||iy|| = 1. Thus woy/t _L I4, Re(,2), Im(z), where I4 denotes 
[1, 1, 1, 1]. Again we invoke linear independence of 1 4 , Re(z), lm(z): w o \/t 
lies in the one-dimensional space 

dC 4 span{l4, Re(^), hn(z)}. 

There is a natural choice of (nonzero) vector in this space: x — y (because 
(x, I4) = (y, I4) = 1, (x,Re(z)) = (y,Re(z)) = Re(a), and (x,Im(z)) = 
(y,Im(z)) = Im(a)). Thus 

w = a(x — y)j o y/t, 
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where /o indicates entrywise division and a is some complex number. Re- 
calling that ||w|| = 1, we derive our formula for (Mw,w) = b(r). 

The necessary condition of Proposition 4 shows that the curve (ie B(a)) 
lies in both conv{22, -23, -24} and conv{zi, 23, 24}, so that it must lie in the 
(closed) opposite quadrant conv{2 3 , 24, q}. To see that the curve (except for 
endpoints) lies in the interior of that quadrant, examine the arguments be- 
low, showing that for b on the quadrant boundary (except for 23 and 24) a 
matching a cannot be interior to the upper quadrant, and note that b € B(a) 
iff a e B(b). 

(b) If a is interior to one of the sides, say [21,22], then C(a) consists of 
a single £ with two positive components; apply Proposition 12(a) to see 
that B(a) = B(a,t) = [23,24]. If a = Z\, Propositions 4 and 6 imply that 
B(a) = conv{2 2 , z 3 , 24}. 

(c) Suppose a is interior to [z\, q}; then the relation (J3J) tells us that 

C(o) = conv{£(l, 2, 3), t(l, 3, 4), t(l, 2, 4)}. 

Let £(1,2,3) = x = [x u 0,x 3 , 0]'; this is also £(1,3,4). Let £(1,2,4) = y = 
[?/i, 1/2, 0,3/4]', so that C(o) = {£(r) : < r < 1}, where 

£(r) = [(1 - r)xi + rj/i, ry 2 , (1 - r)a? 3 , rj/ 4 ]'- 

For r = 0, Proposition 12(a) tells us that B(a, £(0)) = [22,24], while for 
< r < 1 we claim that B(a,t(r)) is a single point 6(r) that moves along 
[23, q), covering it completely. Indeed, reasoning as in (a), we see that b(r) = 
(Mw(r),w(r)) where w(r) is a normalized version of 

(£(0)-£(1))/ OV ^). 

Note that w 2 (r), w±(r) are proportional to — jfe/x/n/i, —Hi/ y/HM respectively, 
so that 

|w2( r )| 2 __ 2/2 

\wi{r)\ 2 2/4' 

Since a = y\Z\ + 2/2^2 + 2/4-24 lies on [z\, z 3 ], we conclude that b(r) G [21, 23] 
also. The necessary condition of Proposition 4 then tells us that b(r) e [23, q\. 
Since £2(7") and £4(r) tend to as r — » 0, lim r .^ ^( r ) = ?■ Moreover, £(1) = 
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[1/1,1/2,0,1/4] so that Proposition 12(b) implies that 6(1) = z 3 . Finally, since 
b(r) is continuous over < r < 1, its values cover [z 3 , q). 

(d) This case may be treated by an argument rather similar to that of (c). 
QED 

We now have the tools to continue the theme of Proposition 12, treating the 
case when exactly four of the components of t G C(a) are positive. 
Proposition 14: Suppose that N > A and that t e C(a) has exactly four 
positive components; for convenience, assume that ti,t2, £3, £4 > and that a 
lies in the upper quadrant relative to Q = conv{zi, z 2 , z 3 , £4}, ie a is interior to 
conv{zi, z 2 , q} (see Figure 3, with the understanding that it is now intended 
to show only the relation of a to Z\, z 2 , z 3 , z±, and Proposition 13). Let ft be 
the curve traced out by b(-) of Proposition 13(a) (and shown in Figure 3). 
Then 

B(a, t) = conv{/3, z 5 , z 6 , . . . , z N }. 

Proof: With u = \/i, we see that the conditions on w, namely 



ffil«,Mo Re(z), u o lm.(z) and ||w|| = 1, 

reduce to 

w _L u,uoRe(z),uoIm(z), 

where w = (wi,W2, 11)3,104)', u = (ui, 112,113, 1x4)' etc, and 

IHI 2 + $^KI 2 = i- 

k>4 

Thus w/||w|| is subject to the same conditions as w in the proof of Proposition 
13(a). It follows that 

(Mw,w) = \\w\\ 2 b(r) + y^ \wk\ 2 Zk 

k>4 

where b(r) can be any point on the curve /3. QED 

Proposition 14 allows us to understand, in large part, the phenomenon il- 
lustrated in Figure 1. Let N = 5 and suppose that each eigenvalue z^ is 
an extreme point of W(M) = conv{zi, . . . , z$} (eg whenever M is unitary). 



For convenience, label the Zk in counterclockwise order. Suppose that a lies 
strictly inside the central pentagon (which is known to be A 2 (M) in this 
case). For each k let (3k denote the curve obtained as in Proposition 14 by 
regarding a as an element of the quadrilateral Qk = convjzj : j ^ k}. Note 
that (3k connects Zfc+2 and Zk+3 (numbering modulo 5) and lies in the quad- 
rant of Qk opposite to the one containing a. We claim that (as illustrated in 
Figure 1) B(a) includes the whole "starfish" region bounded by /?i, /3 2 , . . . , /V 

To see this note that the starfish is the union of the wedges Wk = conv{(3k, Zk}, 
so it suffices to show that each Wk C B(a). Since a G Qk there is t G C(a) 
such that tk — 0. Then Proposition 14 tells us that B(a, c) = Wk- 

Figure 1 was obtained by first computing C(a) via the relation (|3J as 

conv{£(£;, k + 2, k + 3) : k = 1, 2, . . . , 5} 

(note that for a in the inner pentagon, the only eigenvalue triangles containing 
a correspond to the triples Zk, Zk+2, z k+z)- To generate each of the thousands 
of 6's in B(a), plotted as green points in Figure 1, our MATLAB program 
first chose a "random" point t G C(a) (ie a random convex combination of 
the five c(k, k + 2,k + 3)), put u = \/i, then computed b = (Nw, w) where w 
was chosen "randomly" in 

C 5 span{w, u o Ke(z),u o lm.(z)} 

(and normalized so that ||iu|| = 1). The curves (3k were added using the 
formula of Proposition 13(a). Such simulations strongly suggest the following 
"starfish conjecture", since no green dots fall outside the starfish: in such a 
situation (and in particular when N = 5 and M is unitary), B(a) not only 
contains the starfish but is equal to it. 

We have seen in the discussion of Figure 1 that for N = 5 and a,b G A 2 (M) we 
always have a, b as eigenvalues of a normal compression of M. The following 
proposition points out that this is true for any A^ - and that A^ = 5 is, in 
fact, the only subtle case. 

Proposition 15: Let M be normal in M.^ and such that the eigenvalues 
z±, . . . ,Zn are distinct and each is an extreme point of W(M) (eg M unitary). 

"a 0" 



Then a, b G A 2 (M) implies that 



b 
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is a compression of M. 



Proof: For N <3, A 2 (M) = 0. For even N > 4, the relation © tells us that 
A 2 (M) is the "inner iV-gon" cut off by the line segments [zj,Zj+%] (indexing 
modulo N). Thus for even A^ > 4 

A 2 (M) = conv{zj : j odd} fl convjz.,- : j even}, 

and Proposition 6 suffices. For N = 5 the "starfish" discussion proves 
our assertion. For odd A^ > 7 we see that conv{zj : j odd} D A 2 (M) 
and convjzj : j even} covers all of A 2 (M) except that part lying in Q = 
conv{zi, z 2 , Zn-i, zjy}. Hence Proposition 6 suffices for a $ Q,b G A 2 (M). 
The same argument applies for a ^ Q = conv{z 2 .z 3 , z 4 , z 5 } and because 
A^ > 5 this covers any oeQ. QED 

4. Continuity of B(-) 

A natural assertion of "continuity" for B(-) might be that d,H(B(a'), B(a)) — > 
as a' — > a, where dn{X, Y) is the Hausdorff distance between compact 
nonempty sets X, Y C C. Recall that 

d H {X,Y) = max{d H (X,Y),d H (Y,X)}, 

where 

dn{X, Y) = max(min \x — y\). 

However, we have seen simple examples where this fails: recall the analysis 
of B(a) for various a e conv{z l5 z 2 ,z 3 ,z i } that was provided by Proposition 
13. If a' lies in the interior of [zi, z 2 ] and a' — >■ a = z±, then B(a') = [z 3 , z 4 ] 
"jumps" to B(a) = conv{z 2 , z 3 , z^}. A perhaps more surprising example: let 
a be interior to [zi,q] as in Figure 4; for a' approaching a from the interior 
of convj^i, z 2 , q} we see B(a') as a curve joining z 3 and z 4 in convj^, z±, q}, 
whereas for a' approaching a from the interior of convex, z 4 , q] we see B(a') 
as a curve joining z 2 and z 3 in conv{z 2 , z 3 , q}. 

In spite of such "failures" we'll show that B(-) is continuous with respect 
to Hausdorff distance at most points of W(M) and enjoys a "one-sided" 
Hausdorff continuity in general. 

Our standard set-up for this discussion is as in section 3, ie we assume M 
is normal in M^v and is in diagonal form: M = diag(z), where no three 
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eigenvalues are collinear. Thus W(M) = conv{zx, . . . , z^} and B(a') = if 
a' $l W(M). Seeking continuity, we restrict attention to a' — > a with a', a G 
W(M). Note that if JV = 3 and a' is interior to W{M) = conv-{>i, z 2 , z 3 }, 
we again have B(a!) = 0, since b G B(a!) and Proposition 4 would require 
that [a',b] meet each side of the triangle W(M). We therefore restrict also 
to cases where N > A. 

Proposition 16: If N > 4, B(a) is a compact nonempty set for any a G 

W{M). 

Proof: Let £ G C(a). Since JV > 4, 

C^ spanjv^, Vi o Re(», v^ o Im(z)} 

is nontrivial (=^ {0}). Let w be a unit vector in this space; then 6 = 
(Mw,w) G S(o,t), so 5(a) ^ 0. 

For compactness, consider b n G -B(a); there exist orthonormal pairs u n ,w n 
such that 

(Mu n ,u n ) = a, (Mw n ,w n ) = b n , (Mu n ,w n ) = (Mw n ,u n ) = 0. 

Since the sequences u n ,w n are bounded, local compactness in C 2N implies 
that, for some subsequence nk, 

U nk ->k U, W nk -> fc W. 

Then u, w are orthonormal and 

(Mu, u) = a, (Mw, w) = lim b nk = 6, (Mu, w) = (Mw, u) = 0. 

k 

The limit point b is in B(a). QED 

A related argument shows that, in general, B(-) is continuous in a one-sided 

Hausdorff sense. 

Proposition 17: If a, a n G W(M) and a n — > a, then 

^(5(0,^(0)) -> n 0. (4) 

Proof: Recall that dn(X, Y) = max a . e x(min yG y \x — y\). Thus, if (jlj) were 
to fail we'd have some e > 0, subsequence n^, and 6 fc G B(a nk ) such that for 
all 6 G S(o) 

|6 fc - b\ > e. 
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By restricting to such a subsequence we may assume that b n G B(a n ). Let 
u n , w n be orthonormal pairs such that 

{Mu n , u n ) = a n , {Mw n , w n ) = b n , (Mu n , w n ) = {Mw n , u n ) = 0. 

There is a subsequence n^ such that 

U nk ~^k U, W nk -¥ k W. 

Hence u, w are orthonormal and 



(Mu,u) = lima nfe = a, (Af 



w, w) 



limb. 



»k 



(Mu,w) = (Mw,u) 



0. 



It follows that b = lim^ b nk G B(a), contradicting \b nk — b\ > e. QED 



In terms of the obvious extension of Hausdorff distance to compact nonempty 
subsets of A N , we note that C(-) is continuous and in fact satisfies a Lipschitz 
condition for each fixed M. 

Proposition 18: There is a constant K < oo depending only on M such 
that for all a, a' G W(M) 

d H (C(a),C(a')) <K\a-a!\. 

Proof: For each triple i,j,k of distinct indices, we have assumed that 
Zi,Zj,Zk are not collinear. Thus the matrix 



T 



1 1 1 

Re(zi) Re(zj) Re(2fc) 
Im(^) Im(^) Im(z fc ) 



is nonsingular. Given a G conv{zj, Zj, Z).}, consider t^ = t(i,j,k) as in <^. 
Let tijk be the vector in R 3 recording the z, j, k -components of tijk, ie the 
only components that may be positive. We have Ttijk = (1, Re(o), Im(o))' so 
that tijk — T _1 (l, Re(a), Im(a))'. In terms of the operator norm ||T _1 || we 
have 

Mi i / II ^s- 1 1 rri — 1 III / | 

\\Hjk — HjkW — W 1 \\\ a ~ a I 

for any other a' G conv{zi, Zj, Zk}- Let K be the maximum of ||T _1 || over all 
such triangles convj^, Zj, Zk}. 

The line segments [zi, Zj] form a "grid" criss-crossing W(M), dividing it into 
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regions. Suppose a, a' lie in the same one of these regions (boundary points 
allowed). Then the set Q of triples i,j,k such that a G conv{zi,Zj,z k } is 
the same as that for a'. In view of ([3]), each t G C(a) can be expressed as a 
convex combination 

ijkGQ 

Putting 

^ = / 4 S ijktijki 

we have t' G C(a') and ||£ — £'|| < K\a — a'\. The roles of a, a' may be reversed, 
so we see that if a, a' are in the same region (boundary points allowed), 

d H (C(a),C(a)) <K\a-a'\. 

Finally, for any a, a' G W(M), the line segment [a, a'] intersects the grid in 
a sequence of points a , oi, . . . , a n ordered along [a, a'} with a = a, a n = a ' ■ 
By the argument above, 

d H (C(a k ), C(a k+1 )) < K\a k - a k +i\, 

so that (d,H is a metric) 

71-1 

d H (C(a), C{a')) <Kj2\ a k~ a k+1 \ = K\a - a'\. 

k=0 

QED 

Next we show that B(-) is ^-continuous at any point that is "off the grid", 
and that continuity is uniform if we stay bounded away from the grid. 
Proposition 19: If a G W(M) but a does not lie on any line segment [zi, Zj], 
then a! —)■ a implies that 

d H (B(a'),B(a)) -»> 0. 

In fact, on any subset S(d) C W(M) that is a positive distance d from the 
grid 

G = \J{[z i ,z j ]:i,j = l,...,N}, 

so that 

S(d) = {ae W{M) : min \a - g\ > d}, 

geG 
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the map a \— > B(a) is uniformly continuous. 

Proof: In this discussion i,j,k always denotes a triple of distinct indices. 

Let 

Q = \J{C(a) : a G S(d)}; 

we claim that 

mm(ma,x t it jtk) 

£6Q i,j,k 

is positive. Otherwise, by compactness, we'd have some a G S(d) and 
t G C(a) such that maxjj^ tit jtk = 0. This can only happen if t has at 
most two positive components, say U,tj; then a G [zi,Zj], which we have 
ruled out. 

Given linearly independent q,r,s G C N , let P(q,r, s) denote orthogonal pro- 
jection onto 

C N span{g,r, s}. 

The map (q, r, s) h-> P(g, r, s) is uniformly continuous if we "stay away from 
dependence"; to be precise, for any < h < H < oo this map is uniformly 
continuous on 



Q(h, H) = {(q, r, s) : \\q\\, \\r\\, \\s\\ < H, max | det 

i,3,k 



Qi 


Qi 


qk 


n 


r j 


rk 


Si 


s .i 


Sk_ 



> h}. 



Now the values (y/t, y/t o Re(z), y/t o lra(z)) where t G Q lie in some fixed 
Q(h, H) because each 



det 



1 1 1 

Re(zi) Re(zj) Re(zk) 
lm(zi) lm(zj) lm(z k ) 



is nonzero, so that 



max | \J Ut jtk det 

i,j,k 



1 1 1 

Re(zi) Re(zj) Re(zk) 
lm(zi) lm(zj) Im(z k ) 



>h 



for some positive h. Thus the map t >->■ P(\/i, \/io Re(^), \/io lra(z)) = P[t] 
is uniformly continuous on Q: given €\ > there is Si > such that t,t' G Q 
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and \\t - if || < Si implies \\P[t] - P[t'}\\ < e x . 

In view of Proposition 18, there is 5 > such that \a — a'\ < 5 implies 
dH(C(a),C(a')) < S±. Consider b G B(a); for some t G C(a) we have b G 
B(a,t) so that b = (Mw,w) for some unit w with P[t]w = w. Let if G C(a') 
be such that ||t — £'|| < S\\ then ||iu — P\t!\w\ < e±. Note that 

1-ei < ||P[tVll ^ !> 
and let w' = P[t'}w/\\P[t']w\\; then b' = (Mw',w') G B(a') and 

|6-6'| = |(Mw,«;)-(Mw / ,w / )l < 2 ||M|| \\w-w'\\. 

It is easy to see that \\w — w'\\ < 2ei/(l — ex), so that given any e > we have 
1 6 — b'\ < e by an appropriate choice of e\. We have shown that \a — a!\ < 5 
implies that d H (B(a),B(a')) < e. Since the roles of a, a' may be reversed, 
we also have d,H(B(a), B(a')) < e. QED 

Note that sometimes B(-) is continuous even at points that are on the grid. 
For example, from Proposition 13(a) and 13(b) we can see that there is conti- 
nuity everywhere on the boundary segments [z iy z i+ i] except at the endpoints. 

5. Related results 

We offer some remarks on the apparently more difficult problem of charac- 
terizing arbitrary compressions of a normal matrix M. Suppose again that 
M is N x N, and is represented by the diagonal matrix diag(z) and that X 
is a rank-A; compression of M, ie there is a /c-dimensional subspace S such 
that X = PgM\s- From Proposition 3 we obtain a necessary condition on 
X: the (classical) numerical range W(X) of X must intersect the convex hull 
of any subset of the eigenvalues Zj having size N — k + 1. 

When k = 2, ie X is represented by a 2 x 2 matrix, the numerical range 
W(X) determines X uniquely as an operator. Indeed, W(X) is a (filled-in) 
ellipse in this case with the eigenvalues of X as foci and the length of the 
minor axis is the modulus of the off-diagonal entry of any upper-triangular 
matrix for X. Let's consider the problem of characterizing such compres- 
sions X geometrically via the elliptical W(X) in the cases where N = 3 and 
N = A. 
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When N — 3, the necessary condition of above tells us that W(X) must be 
tangent to each of the three sides of coiw{zi, Z2, £3} (recall that Proposition 
5 tells us that in general we must have W(X) C W(M) = conv{zj : j = 
1, . . . , n}). In fact, Williams showed long ago that the necessary condition is 
also sufficient when N = 3 (see [Wi]). 

When N = 4 we consider the case where the eigenvalues Zj form a quadri- 
lateral Q. The necessary condition above tells us that W(X) must intersect 
each of the four triangles Tj = conv{zj : j 7^ i}. Thus W(X) must intersect 
each of the quadrants T{ fl T^. This phenomenon is borne out by numerical 
experiments such as Figure 5 illustrates, but it is not clear what additional 
conditions must be satisfied by W(X), even in this N = 4 case. Of course, 
if by chance W(X) is tangent to all three sides of some Tj, then Williams' 
result tells us that X is indeed a 2-dimensional compression. 
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Figure 5: Shows the (elliptical) boundaries of the numerical ranges of several 
(nonnormal) compressions of a 4 x 4 normal M, each compression having 
a (red asterisk) as an eigenvalue (therefore seen as one of the foci of each 
ellipse) 
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